-
-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Document binomial-logit GLM #678
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This one needs a fair amount of work to get it up to the standard of our other doc in terms of naming arguments. There's also some inconsistencies in notation to iron out (little vs. big N for example in sizing, for example).
|
||
### Probability mass function | ||
|
||
Suppose $N \in \mathbb{N}$, $x\in \mathbb{R}^{n\cdot m}, \alpha \in \mathbb{R}^n, \beta \in \mathbb{R}^m$, and $n \in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would start by saying that M is the number of predictors and N is the number of data items.
Then we need the notation to match with caps and to use \times not \cdot
- x is in R^(M x N) (with \times, not \cdot)
- \alpha in \mathbb{R}
- \beta \in \mathbb{R}^M
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Then we need the notation to match with caps and to use \times not \cdot
That's also inconsistent with the other _glm
docs:
### Probability mass function | ||
|
||
Suppose $N \in \mathbb{N}$, $x\in \mathbb{R}^{n\cdot m}, \alpha \in \mathbb{R}^n, \beta \in \mathbb{R}^m$, and $n \in | ||
\{0,\ldots,N\}$. Then \begin{align*} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Start the begin{align*}
on its own line and line up the text under it so that it's readable. Looks like this accidentally got collapsed into a paragraph.
In the Stan doc, if we have an M x N matrix, we've conventionally used [m, n] for indexing, not [i, j].
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the Stan doc, if we have an M x N matrix, we've conventionally used [m, n] for indexing, not [i, j].
That's not the case for the _glm
docs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Then we should change the _glm
docs to be consistent with the rest of the User's Guide. I never wrote down hard and fast style rules and things like the symbol for expectation starts drifting under multiple authors. I mean to go and do a consistency fix of the entire doc set soon.
<!-- real; binomial_logit_glm_lpmf; (int n | int N, matrix x, real alpha, vector beta); --> | ||
\index{{\tt \bfseries binomial\_logit\_glm\_lpmf }!{\tt (int n \textbar\ int N, matrix x, real alpha, vector beta): real}|hyperpage} | ||
|
||
`real` **`binomial_logit_glm_lpmf`**`(int n | int N, matrix x, real alpha, vector beta)`<br>\newline |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Like our other function doc, this should name the arguments. alpha is an intercept, beta is a vector slopes, x is the data matrix, N is the total count and n is the count of successes.
Now if x
is a matrix, doesn't n
and N
have to be 1D arrays?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Like our other function doc, this should name the arguments. alpha is an intercept, beta is a vector slopes, x is the data matrix, N is the total count and n is the count of successes.
That would be inconsistent with the current doc for other _glm
functions:
Now if x is a matrix, doesn't n and N have to be 1D arrays?
No, they would be broadcast to match. This is the same way in the bernoulli_logit_glm
doc
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Then normal_id_glm
and bernoulli_logit_glm
should be fixed to match the rest of our doc. Given that we've thrown out consistency, there's no need for any new GLM to be consistent with the other GLM doc. I'd rather it be consistent with the rest of our doc as that's where it's going to go in the end.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I should've added that this doesn't need to be done as part of this PR. If you leave the new _glm
like the other GLM code, I can just fix it in a pass to make everything consistent again.
\index{{\tt \bfseries binomial\_logit\_glm\_lpmf }!{\tt (int n \textbar\ int N, matrix x, vector alpha, vector beta): real}|hyperpage} | ||
|
||
`real` **`binomial_logit_glm_lpmf`**`(int n | int N, matrix x, vector alpha, vector beta)`<br>\newline | ||
The log Binomial probability mass of n given N trials and chance of success |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't need to say "mass" here---it's just a probability. Or it's a "log probability mass function" if you want to spell it all out.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"probability mass" is used throughout the other function docs:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd argue that "probability mass" is not idiomatic in this context because we just call the resulting quantity "probability" not a "probability mass". It's a "probability mass function" but it returns a probability not a probability mass. So this is another case where the binomial, Bernoulli, etc. need to be fixed. No worries if you can't get to it in this PR.
<!-- real; binomial_logit_glm_lupmf; (int n | int N, matrix x, vector alpha, vector beta); --> | ||
\index{{\tt \bfseries binomial\_logit\_glm\_lupmf }!{\tt (int n \textbar\ int N, matrix x, vector alpha, vector beta): real}|hyperpage} | ||
|
||
`real` **`binomial_logit_glm_lupmf`**`(int n | int N, matrix x, vector alpha, vector beta)`<br>\newline |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It doesn't make sense for x to be a matrix and n and N to be scalars. Is there interpretation here that you are broadcasting the n and N for all of the rows of x?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's right, this is consistent with the signatures for bernoulli_logit_glm
and normal_id_glm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks. I think it'd help clarify this in the doc. For instance, line 315 says "The log normal probability density of y
given location alpha + x * beta
and scale sigma
." but in this case y
is a vector and alpha + beta * x
is a vector, so calling a vector a location seems to violate agreement (plural/singular).
It would be nice if an updated version of this was merged before the release next week. @andrjohns do you have the time? |
@bob-carpenter are there changes you still think are required for this PR, or are they all things which can/should be done in follow ons? |
I confirmed with @bob-carpenter the remaining issues here can be a follow on. I've opened #705 for them. |
Submission Checklist
`r since("VERSION")`
Summary
This PR adds documentation for the
binomial_logit_glm
GLM distribution added in this PR. The implementation & likelihood/gradients are (unsurprisingly) very similar to thebernoulli_logit_glm
distribution, so I've based the documentation on that entry.Let me know if I've missed anything!
Copyright and Licensing
Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): Andrew Johnson
By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses: